VPsearch: fast exact sequence similarity search for genomic sequences

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A fast algorithm for exact sequence search in biological sequences using polyphase decomposition

MOTIVATION Exact sequence search allows a user to search for a specific DNA subsequence in a larger DNA sequence or database. It serves as a vital block in many areas such as Pharmacogenetics, Phylogenetics and Personal Genomics. As sequencing of genomic data becomes increasingly affordable, the amount of sequence data that must be processed will also increase exponentially. In this context, fa...

متن کامل

SW#db: GPU-Accelerated Exact Sequence Similarity Database Search

In recent years we have witnessed a growth in sequencing yield, the number of samples sequenced, and as a result-the growth of publicly maintained sequence databases. The increase of data present all around has put high requirements on protein similarity search algorithms with two ever-opposite goals: how to keep the running times acceptable while maintaining a high-enough level of sensitivity....

متن کامل

Querying Timestamped Event Sequences by Exact Search or Similarity-based Search: Design and Empirical Evaluation

Specifying timestamped event sequence queries is challenging even for skilled computer professionals familiar with SQL. Most graphical user interfaces for database search use a exact search approach, which is often effective, but applies an exact match criteria. We describe a new similarity-based search interface, in which users specify a query by simply placing events on a blank timeline and r...

متن کامل

Similarity Search In Sequence

We propose an indexing method for time sequences for processing similarity queries. We use the Discrete Fourier Transform (DFT) to map time sequences to the frequency domain, the crucial observation being that, for most sequences of practical interest, only the rst few frequencies are strong. Another important observation is Parseval's theorem, which speciies that the Fourier transform preserve...

متن کامل

Similarity Search for Multidimensional Data Sequences

Time-series data, which are a series of one-dimensional real numbers, have been studied in various database applications. In this paper, we extend the traditional similarity search methods on time-series data to support a multidimensional data sequence, such as a video stream. We investigate the problem of retrieving similar multidimensional data sequences from a large database. To prune irrele...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of open source software

سال: 2022

ISSN: ['2475-9066']

DOI: https://doi.org/10.21105/joss.04236